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High-throughput experimental techniques and bioinformatics tools make it possible to 
obtain reconstructions of the metabolism of microbial species. Combined with mathemat¬ 
ical frameworks such as flux balance analysis, which assumes that nutrients are used so as 
to maximize growth, these reconstructions enable us to predict microbial growth. Although 
such predictions are generally accurate, these approaches do not give insights on how differ¬ 
ent nutrients are used to produce growth, and thus are difficult to generalize to new media 
or to different organisms. Here, we propose a systems-level phenomenological model of 
metabolism inspired by the virial expansion. Our model predicts biomass production given 
the nutrient uptakes and a reduced set of parameters, which can be easily determined exper¬ 
imentally. To validate our model, we test it against in silico simulations and experimental 
measurements of growth, and find good agreement. From a biological point of view, our 
model uncovers the impact that individual nutrients and the synergistic interaction between 
nutrient pairs have on growth, and suggests that we can understand the growth maximization 
principle as the optimization of nutrient synergies. 
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I. INTRODUCTION 

The rapid development of high-throughput experimental teehniques and bioinformaties tools 
has made it possible to obtain reliable metabolic reconstructions from genomic data in a semi¬ 
automatic fashion HBa. The availability of such reconstructions makes it possible, in turn, to 
investigate metabolism from a systems point of view dH . In particular, the development of a math¬ 
ematical framework to predict cellular growth based on cellular function optimization has signif¬ 
icantly advanced our understanding of how the metabolic state of an organism will change upon 
modifications in the growth medium, the introduction of mutations, or the effect of stress [l6]-fT^. 

Unfortunately, our ability to calculate microbial growth rates has not been paralleled by a sub¬ 
stantial gain of insight into metabolic processes, especially for what concerns the impact of nu¬ 
trients on growth. A number of mathematical models have been developed aiming at predicting 
microbial growth rates [fT3l - fT^ . but these models are only valid for a limited number of specific 
nutrients and are not easily generalizable because of the need to determine parameters empirically. 

Here, we present a systems-level phenomenological model that enables us to predict growth 
and, at the same time, provides insights into the effective systems-level principles by which nutri¬ 
ents are catabolized. Our approach does not predict which nutrients will be uptaken from a given 
medium; rather, it predicts, from the values of the uptakes, how each nutrient will contribute to 
cellular growth. Despite the fact that we use flux balance analysis (FBA) to develop, justify and 
validate our model (and that, as we discuss later in Section |IV| FBA has well known limitations), 
the model is ultimately independent of FBA and of any particular metabolic reconstruction; in this 
sense, the model is also organism-independent. 

Our approach, which is analogous to a virial expansion, reveals that cellular growth can be 
well-approximated by the contributions of each individual nutrient plus a synergy term that con¬ 
siders nutrient-pair contributions. We demonstrate that the predictions of the model are in good 
agreement with empirical measurements of biomass production. Moreover, our model provides 
novel insight into the effective contributions to growth since we can express synergy contributions 
as scaling functions that depend exclusively on four factors: the type of nutrients considered, the 
pathways that catabolize them, the ratio between their uptake fluxes, and the effective carbon con¬ 
tent of each nutrient. Uptake fluxes are allocated among possible synergistic contributions in order 
to maximize synergy, thus revealing the principles of nutrient use that lead to the maximization of 
biomass production. 


II. MODEL 

Our goal is to express in closed form the steady-state growth rates of a bacterium given the 
nutrient uptakes from the external medium, without taking explicitly into account any micro-level 
information about the processes occurring inside the cell. In lfT9l and Il20ll . models to predict which 
nutrients can produce growth and what constraints are necessary to reproduce observed uptakes in 
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rich media were already developed. Here, we consider that the real uptake fluxes of each nutrient 
are known and fall within the empirical range which ensures that nutrient uptakes can be fully 
catabolized flU. 

To validate our model, we use FBA predictions of biomass production for Escherichia coli 
using the metabolic reconstruction iAF1260, which has been shown to yield a good agreement 
with empirically measured growth rates [l^ . Note that we focus exclusively on the use of nutrients 
for biomass polymerization, discarding the role of ATP maintenance (see [fT9l and Sec. |IV]). For 
simplicity, we focus on nutrients that belong to one of the four main nutrient classes: sugars, fatty 
acids, amino acids, and bases (see Appendix for a complete list). 

Following a virial expansion-like formulation, we hypothesize that, given a fixed vector of 
nutrient uptake fluxes 0 , we can express the steady-state biomass production of an organism as 

E E 

i=l j<k 

+ ^ 4>j-,4>k) + • ■ ■ , 

i<j<k 

where E is the number of uptakes. 

A first order approximation is equivalent to considering that each single nutrient contributes 
independently to g{4>) as in [[T9l . In analogy to the ideal gas approximation, we call this model 
idealized metabolism (IM). Note that because we consider the nutrient use for stationary biomass 
production exclusively, in the presence of a single nutrient uptake {i.e. (j)i 0 for a single i 
and = 0 for k ^ i) the scale of our system is precisely given by 0,. Therefore, the biomass 
production must be proportional to 0 *, so that where a is the biomass yield of nutrient 

i [l9l[T3|22l. For the first order terms, we thus write: 

E E 

i=l i=l 

We evaluate dj for each nutrient i by computing the FBA biomass production 5 'fba( 0*'*^) allowing 
for a single nutrient uptake 

0 j = 1 arb.units, 

I 0j = 0 arb.units Vj 7 ^ 0 

where we use arbitrary units, since all fluxes are defined up to a multiplicative constant in the 
FBA problem. Note that in Eq. Q, only purines among bases can be accounted for growth, 
since pyrimidines alone cannot be catabolized by E. coli ifT^ . Previously, we found that d* is 
proportional to the effective number of carbons C*, that is, the number of carbons that are actually 
catabolized Q in each metabolite i as 

d^ (3) 

^ For the nutrient classes we consider, the effective carbons equal the actual carbons for all nutrients except for the 
bases 
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with a slope Oc that is nearly insensitive to the nutrient elass c (fatty aeids, sugars, amino aeids, 
Fig. [^). Here, both the veetor & and the slopes Uc are dimensionless quantities. 

To assess the aeeuraey of the IM, we eompare the predietions of the model against FBA eal- 
eulations for the growth of E. coli on random eomplex media with a fixed number of non-zero 
nutrient uptakes (Methods). Beeause g is defined up to a multiplieative eonstant, the largest the 
total uptake, the largest the biomass produetion. We thus eonsider eomplex uptake veetors nor¬ 
malized to 1, to mimie physiologie eonditions. However, we note that we would obtain the same 
relative errors for a fixed number of uptakes if we eonsidered non-normalized fluxes. 

Figure shows that despite its simplieity, the idealized model is fairly aeeurate, with a relative 
error, A := , ranging from ~ 0-2% for one nutrient to 24% for 20 uptakes. Note that 

using Eq. Q to prediet growth lightly overestimates single nutrient eontributions to growth, as 
the eorresponding A for growth on one nutrient shows. This effeet however is negligible when 
inereasing the number of uptakes above E > 5. It is also apparent that the IM systematieally 
underestimates FBA predietions for media with E >2 nutrients, whieh implies that when several 
nutrients are present, they eontribute synergistieally to growth. 


III. RESULTS 

A. Scaling of second order terms 

In order to eapture nutrient growth synergies, we eonsider next the seeond order terms in 
Eq. Q. Using EBA, we numerieally determine (3ij by setting to zero all entries of the exehange 
fluxes exeept (pi and cpj and eomputing the differenee 

(3ij{(pi, (pj) = gFBA{<P^'’^'*)-ai(pi - ajcpj, (4) 

where is the veetor <p sueh that cpk = OWk ^ i, j (Eig. H). 

Sinee there is only one output in our system (biomass), the seale of of g is fixed by one of 
the uptake fluxes (for instanee (pj) and the dependeney on the remaining uptake fluxes ean be 
expressed as dimensionless quantities, whieh are ratios of uptake fluxes. As a eonsequenee, we 
expeet (3 to obey a sealing property (Eig. [^): 

cpj) = t^ij = Ai • (5) 

Remarkably, we find that j3 displays additional sealing properties. Eor eonereteness, eonsider the 
synergy between sugars and fatty aeids. We found that the (3 funetions for any sugar-fatty aeid 
pair (Eig. |^) eollapse on the same eurve when the sugar and the fatty aeid uptake fluxes (pi, (pj are 
resealed with respeet to the effeetive number of earbons Ci, Cj of the eorresponding nutrient (Eig. 
[^). One thus has 

Aug,f_acid(^) = 



( 6 ) 
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so that the introduction of the rescaled /?' function allows to have a systematic description of 
growth only given the nutrient-pair classes, their carbon content and the ratio of their uptake fluxes. 
For each nutrient-class pair a, a' it is therefore possible to define a function that displays a 
simple two-regime behavior (Fig. [^), in which one of the nutrients becomes the limiting factor 
in the contribution to growth. Considering again the case of sugars and fatty acids, when the ratio 
iCj(j)j) —)■ 0 the function /S'ug.f acid grows linearly, while when Ci(j)i/{Cj(j)j) S> 1 it reaches 
a plateau. To capture these two regimes, we propose the generalized phenomenological model: 


where 



(7) 


<Pi/(Pj—^OO 


( 8 ) 


Here at and aj are the classes of nutrient i, j, respectively, while 6^.^^ and b^^ai are dimensionless 
parameters, since they are defined as a flux ratio. These parameters can be interpreted as the limit¬ 
ing synergistic contribution to the biomass yield when one of the two nutrients is in excess of the 
other. In this formulation, knowing the limiting contributions is thus enough to compute the syn¬ 
ergistic contribution to growth of any sugar-fatty acid pair and for any value of the uptake fluxes. 
For instance, the transition value T(sug, f_acid) = 6, .. /b , marks the relative sugar-fatty 
acid uptake values at which maximal synergy may be attained without waste of nutrients. 

Figure shows the averaged collapsed curves for all nutrient class pairs we consider. Our 
calculations indicate that Eq. Q is a fairly good description for such averaged /?', although we 
note that for each nutrient class pair (3' has different parameters (see Table |T] and Appendix for 
a summary of the averaged parameters for each one of these curves). Note that, for nutrients in 
the same class, it is not necessary to consider all pair permutations. One can, for instance, sort 
nutrients in a given class a by their carbon content and evaluate the parameters b^^j only between 
parrs i, j such that Ci < Cj. This is the approach we follow in evaluating the parameters 
which, as a consequence, are not symmetric when a = a'. 

The phenomenological model in Eq. (|7]) captures very well the behavior of (3' for 4 of the 9 
cases: (fatty acid, sugar), (fatty acid, fatty acid), (base, sugar) and (base, base) pairs (Eigs. |^, d, 
b, and g)|^ Eor the (base, fatty acid) case (Eig. [^e), we find that the phenomenological model in 
Eq. Q does not fully capture the behavior of the averaged (3' (see Appendix). In such case we still 
find that [3' is roughly linear for 0i/02 ^ 1 and shows a plateau when 0i/02 ^ 1, as predicted 
by Eq. Q. However, for Ci^i/(C202) — 1, the model overpredicts the observed synergy. Despite 


^ Note that nutrients in the same class are ordered with their carbon content and pair permutations are not considered. 
Thus in /3'(^), (pi always corresponds to the nutrient with the smaller number of carbons. This implies that, for 
the (3' within the same class, the average slope and plateau values are not equal (see Table [^. We also remind that, 
in the base-base pair case, we only consider pairs of purines as E. coli cannot catabolize pyrimidines by themselves. 
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this deviation, Eq. (|7]) is a good trade off between model simplieity and predietive power, sinee the 
initial slope of j3' and the plateau value are well predieted by taking the average of the parameter 
b over all nutrient pairs. 

Finally, for all pairs ineluding amino aeids (Figs. |^e,f, h, and i), we find that not all eurves 
eollapse into a single one. In partieular, we see that when Mother/0a.acid S> 1 ({other: sugar, 
f_acid base, a_acid}), the sealing funetions reaeh different plateau values, whieh always lie either 
above or below a 10“^ threshold value, respeetively. Interestingly, for interelass interaetions, any 
given amino aeid eonsistently reaehes a plateau above or below sueh threshold independent of the 
other nutrient paired with it. We henee elassify amino aeids into two groups, L (Fow synergy), 
H (High synergy), aeeording to whether they ean attain a synergy below or above the mentioned 
10“^ threshold, for interelass synergies. For amino aeid-amino aeid interaetions, we thus divide 
nutrients into H and L and study intraelass/F-H synergies. This allows us to find two slope and 
plateau values respeetively, eaeh related to the H or L amino aeid limiting the interaetion in turn. 

Using a logistie regression model, we find that the set of metabolie pathways in whieh an amino 
aeid partieipates determines to whieh group {H or L) it belongs (see Appendix). By minimizing 
the Bayesian Information Criterion [l23ll . we see that knowing whether the amino aeid partieipates 
in the set of six pathways listed in Table|I^is enough to eorreetly assign all amino aeids exeept MD- 
Methionine to either group H or L. Onee the eorresponding group is known, we ean use Eq. (|7]) 
to deseribe (3' by allowing two plateau values when the nutrient pair involves an amino aeid. In 
this way, we ean have elose estimates of synergies through the funetion Eq. ([^ for nutrients pairs 
from all elasses, by only knowing their elass and the pathways in whieh they partieipate. 


B. Competition for synergistic potentials 

When a baeterium grows on a eomplex medium with E > 2 nutrients, Eq. Q yields a sum 
over E{E — l)/2 synergy eontributions resulting in an overpredietion of the biomass produetion 
(see Appendix). The reason for this is that resourees are limited by stoiehiometry, thus besides 
the independent nutrient eontribution to growth of eaeh uptake 0i, resourees must be distributed in 
some way among the E — 1 possible synergies. Two plausible flux alloeations are the following: 
i) an equitative distribution of all {0,} among the synergies (equitative synergy model, ES); ii) a 
distribution among synergies that yields maximal synergy, whieh we eall optimal synergy model 
(OS). We find that while the former underprediets growth rates when inereasing the number of 
uptakes, the latter yields an aeeurate predietion of FBA growth rates roughly independent of the 
number of nutrients (Fig. |^and Appendix). Our results thus suggest that, phenomenologieally, 
one ean understand the growth maximization prineiple observed in mierobes as the optimization 
of nutrient synergies. 

The OS theory exploits the faet that synergy eontributions are limited by the smallest uptake 
flux Eq. (|^, so that only the nutrients in exeess ean be used in other synergies. In order to maximize 
the overall synergy, we hypothesize that an optimal alloeation of nutrients is adopted to produee 
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the largest pair-synergies. We thus rank nutrient-pair synergies and add up to the total synergy 
eaeh eontribution. After eaeh addition, the fluxes of the pair are resealed sueh that the limiting one 
is not eonsidered further, while the nutrient in exeess ean eontribute to other synergies with the 
fraetion of uptake not invested yet (Methods). 

In a eomplex growth medium with E non-zero nutrient uptakes, we thus express the OS growth 
rate as follows: 


E 

9{4>) = 

i=l 

p 

(«:j) = l 




(9) 


where the seeond sum runs over the P = E{E — l)/2 ranked pairs of nutrients, is the ranking 
of the nutrient pair synergy (k, j), and G [0,1] indieates the fraetion of uptake flux 0^ yet 
to be alloeated to this eontribution. As before, Ci is the effeetive number of earbons of nutrient 
i and ai is the nutrient elass to whieh nutrient I belongs, and eoeffieients b have been reported 
in Table The yields ean either be direetly evaluated for eaeh nutrient, or eomputed as in 
equation Eq. Q, with parameters a reported in Fig. [^b. Note that, when available it is preferable 
to use the exaet a when dealing with less than 4 nutrients, beeause Eq. Q slightly overprediets 
single nutrient eontributions to growth in this ease (this effeet however vanishes when dealing with 
E > 5 nutrients). 

Finally, we eompare the biomass produetion predietions of our OS model Eq. Q against FBA 
predietions for E. coli in media with a fixed number of non-zero random nutrient uptakes normal¬ 
ized to 1 (Methods). 

Figure]^ shows the OS model is able to prediet with high aeeuraey the growth rates eomputed 
by using FBA assuming known uptakes. The average relative error A := eomputed 

over 500 different random growth media with fixed number of uptakes is systematieally smaller for 
OS model predietions than for those of the IM. Notably, the gap between the two models inereases 
with the number of uptakes, due to the more synergistie eontributions that are being negleeted by 
the IM model. 

Sinee sugars are the main souree of earbons and are quite eommonly ineluded in experimental 
growth media, to reproduee these media we always allow the uptake of one sugar. For more 
random nutrient setups we find A of the OS to be slightly larger, but still eonsistently smaller than 
the IM theory (see Appendix). 


C. Comparison with experiments 

After validating our model in silico, we test here how well the OS model prediets aetual growth 
rates in vivo. To do so, we eompare our model with experimental measurements of nutrient uptakes 
and growth for baeterial eulture on eomplex media. Note that obtaining sueh type of data is 
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generally not straightforward as measurement of multiple uptakes is typieally hard. Additionally, 
to date, standard experiments used to validate FBA generally foeus on the simpler ease of growth 
media with a single souree of earbon. Nevertheless, a very interesting study on eomplex media 
where baeterial growth rate and variation of nutrient eoneentration are measured was published by 
Beg et al. [l20ll . The authors performed there some E. coli bateh eulture experiments that allowed 
them to estimate those quantity simultaneously as a funetion of time. From their published data, we 
were able to reeover the nutrient uptakes eorresponding to every measured growth rate (Appendix) 
and to use sueh uptakes as inputs in our model. This approaeh allowed us in turn to eompare the 
predieted growth rate with the experimental one. 

The results are reported in Fig. where we eompare OS model predietions with the experi¬ 
mentally measured growth rates. Note that now that physiologieal uptake and growth values are 
measured, we ean use proper mmol gDW“^h~^ units for the former and h~^ for the latter. When 
doing so, model Eq. (|9]) reaehes a remarkable aeeuraey, espeeially taking into aeeount that i.) the 
E. coli strain in the experiments differs from the reeonstruetion at our disposal and ii.) we used the 
b and a parameters we derived by ealibrating the model with FBA, rather than estimating them ad 
hoc, thus highlighting the broad applieability of our model. 

The exeellent agreement we found between the growth predieted by our model and the aetual 
growth on a eomplex medium supports that sealing and synergy really are two prineiples regulating 
mierobial growth in vivo besides their role in modeling metabolism in silico. 


IV, DISCUSSION: SCOPE AND POTENTIAL LIMITATIONS OF OUR APPROACH 

We have used FBA predietions under growth optimization as a reliable souree of growth rates, 
that is, as a substitute for growth experiments with real baeteria. Thus, even though our model is 
ultimately independent of FBA (in that Eq. Q does not rely in any way on EBA or on any partieular 
metabolie reeonstruetion), one may argue that our model is suseeptible to suffer the shorteomings 
of EBA. Here we diseuss these shorteomings, although the eomparison to experimental data in 
Eig. 1^ demonstrates that, whatever limitations EBA may have, our model is able to reproduee 
experimental growth rates in a variety of realistie eonditions. 

The first issue is the determination of the so-ealled ATP maintenanee flux. This is an additional 
reaetion flux that EBA adds to the set of metabolie reaetions and eonstraints to reproduee the 
experimental growth rates. Sueh ATP flux eneompasses a series of external faetors that affeet 
mierobial growth rates, sueh as the uptake rate of nutrients, oxygen availability, and regulation or 
temperature. But although ATP maintenanee rates obtained for a speeifie minimal medium have 
been shown to reproduee aeeurate results in different growth eonditions for eertain organisms 
im, it eannot be assumed that speeifie values are valid to make predietions for different growth 
eonditions in general. To overeome this, we proeeed as in [[T9l and first evaluate the ATP needed 
for the polymerization of biomass eomponents by using the values experimentally determined 
(whieh are available in the literature [l24l|25l) and then fix the ATP maintenanee to this baseline. 
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removing any further ATP maintenanee eontribution. In any ease, it is always possible to reseale 
our findings a posteriori in the same way ATP maintenanee is fitted within the FBA approaeh. 
Moreover, Fig. [^suggests that the effeet of the maintenanee flux is not very relevant. 

Another eaveat of FBA is that it systematieally prediets the simultaneous uptake of different 
sugars, while it is known that mierobes absorb their preferred sugar first [|2^ . For this reason 
FBA will regularly over-prediet biomass produetion in presenee of multiple sugars IITTTI . In our 
approaeh this is mostly irrelevant beeause we are eoneerned with determining growth given the 
uptakes of nutrients. In any event, to avoid validating our model against unrealistie settings, we 
foeus on eomplex growth media eontaining a single sugar (Methods and Appendix). 

Finally, it has been empirieally demonstrated that under eertain eonditions, unieellular organ¬ 
isms do not strietly follow a maximal growth prineiple ifT^ . However, it has also been shown that 
in many oeeasions the metabolie state predieted by growth maximization is very similar to that 
of the maximization of other funetions ffm, so that our formalism eould be applieable to these 
eonditions. 


V. CONCLUSIONS 

In this work, we present a seeond order phenomenologieal model of metabolism that, by relying 
on a very limited set of parameters, is able to prediet the biomass produetion of E. coli in arbitrary 
eomplex growth media within 1% of the aetual value for growth in silico and with great aeeuraey 
for growth in vivo. 

Our model shows that nutrients within the same elass are effectively eatabolized in a similar 
manner, so that the eontribution to growth in the presenee of a given nutrient is fully determined 
by the nutrient’s effeetive earbon eontent and the elass it belongs to. We find that the synergy 
developed by the uptake of several nutrients inereases the eatabolie potential of the metabolie 
network. Sueh synergy between nutrients pairs depends on the relative abundanee of the nutrients 
and is eapped by the less abundant nutrient. 

Our model shows that, effeetively, nutrient eontributions to growth ean be well approximated 
by the sum of the independent eontribution of eaeh nutrient and a synergy eontribution. The syn¬ 
ergy eontribution depends exelusively on nutrient pair synergies so that uptake fluxes are alloeated 
among pair synergies in order to maximize the synergy eontribution with the available resourees. 
In this way, the funetion maximization prineiple (usually growth) that determines the metabolie 
state of a unieellular organism ean be effeetively understood as the optimization of nutrient syner¬ 
gies. 


METHODS 


Random flux uptakes generation 

For each fixed number of uptakes E, we generate a vector cf) of uptake fluxes that allows the bacterium 
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to catabolize a combination of fatty acids, amino acids and bases, plus one sugar only. To do so, only 
one of the entries of cj) that do correspond to sugar uptakes is chosen uniformly at random to have a value 
different from zero. Such value is uniformly drawn at random in the range (0,1) arb.units. All E — 1 
remaining uptakes are uniformly chosen at random among entries of 4> that do not correspond to a sugar. 
Again, the flux value is drawn in the range (0,1) arb.units. After all the E nonzero entries of 0 are drawn, 
we normalize the uptakes so that the total uptake is always equal to one (see Appendix for results in other 
complex media). 

Optimal synergy model 

Suppose we want to compute the growth of a vector (j) of uptake fluxes with E non-zero entries according 
to the OS model Eq. Q. 

In order to allocate the uptake of fluxes to maximize synergy we proceed as follows. First, we compute 
all £'(£' — l)/2 synergies f3' and rank them according to their corresponding contributions to growth from 
largest to smallest. Starting from the largest, we evaluate which nutrient in the pair (ni, n 2 ) is in excess 
by comparing the flux ratio Cni 4 >nT_/{Cn 24 >n 2 ) to to® transition value r(ni,n 2 ) = bn 2 ni/bnin 2 of to® 
corresponding /3' function. For instance, if C'n^^„i/(C'n 2 </’n 2 ) < ^ 2 ); ^2 is in excess. We then store 

this contribution, set the limiting flux to zero and reduce (/>„2 by its distance from the transition value as 
4>n2 —^ 4>n2 — C'ni/C'n 2 '/’niT'(ni, 712 ). Note that this implies that is not used in other synergies. All the 
other fluxes are kept constant. These updated fluxes are used to re-compute the synergies occupying lower 
positions in the rank, and the process is repeated for the second largest fi'. In this way synergies at position 
k in the rank are computed with effective fluxes that take into account both the limitedness of 

resources and their optimal routing. 

A slightly different version of our approach, where ranking of synergies is computed after each step 
(j)^ —)• is not as accurate as the protocol described above (see Appendix and fig. |^. 
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Figure 1: Idealized metabolism theory, (a) The a parameters introduced in Eq. (|^, versus the number of effective 
carbons for each of the nutrients considered in our study. We consider nutrients in four groups: sugars, fatty acids, 
bases and amino acids. The a coefficients are a linear function of the effective number of carbons whose slope depends 
very weakly on the nutrient class, except for bases (see panel b). The dashed lines show linear fits for each class of 
nutrients, while the black dotted line is a fit considering all of them together, (b) The coefficients Oc introduced in 
Eq. We show the values of Oc obtained from the fits shown in panel a). Qc varies weakly with nutrient class, (c) 
Predictions of the idealized metabolism theory, Eq. Q, versus EBA results for a selection of 100 random media with 
increasing number of possible uptakes (see Methods). Eilled red circles correspond to using exact a values, while 
empty blue squares to Eq. (j^. (d) The relative error A = of the IM theory predictions for the two 

different choices of a averaged over 500 random media, for increasing number of uptakes. A is relatively small in 
presence of a few nutrients only, but it increases roughly linearly. Note that the error performed when using Eq. 0 
in presence of one nutrient only is different from zero, meaning that Eq. ([^ does not correctly capture single nutrient 
contributions to growth. This effect however is negligible increasing the number of nutrients, as the two A curves 

overlap. 
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Figure 2: Scaling of nutrient synergy contributions, (a) The function /?, Eq. (|4|i, that expresses the gap between the 
linear model predictions Eq. Q and the EBA results for the growth rate of E. coli, when there are two nutrient uptakes 
different from zero. We show here the simultaneous uptake of dodecanoate and butyrate (both fatty acids) as a typical 
example. ^ is a growing function of the exchange fluxes of both nutrients. The circles and crosses correspond to the 
two (example) curves that are shown, once rescaled, in panel (b). (b) Scaling property of ( 3 , Eq. 0- We plot the same 
data points of panel (a); each curve shows l3/(j)2 as a function of (j)i/(j )27 for two different fixed values of (j)i. Such 
normalization allows to collapse all points on the same curve, (c) The function Eq. Q for a set of five sugar-fatty acid 
pairs, that shows a characteristic linear-plateau behavior, (d) The rescaling property Eq. 0. We rescale the uptake 
fluxes of the nutrient pairs shown in panel c with the number of carbons of each nutrient. All the points collapse on 
the same curve. The dotted line corresponds to the function Eq. 0, where we set ^ as the average of the set 

tai ^ta B foi^ sugar-fatty acid pairs. 
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Figure 3: Nutrient synergy contributions. We show the j3' function, Eq. for pairs of four nutrient classes: sugars, 
fatty acids, bases and amino acids. Dashed lines correspond to the function in Eq. (|^ where the parameters {b^j} are 
averaged over all pair of nutrients in the corresponding pair of classes. 
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Table I: Average numerical values of the parameters of the phenomenological model in Eq. (|7|). We show here the 
average slope (&12) and plateau (&21) values of the /3' functions for the cross interactions plotted in Fig. For nutrient 
pairs involving an amino acid we obtain two different plateau values, depending on the metabolic processes in which 
the amino acid participates (see text). If two amino acids are involved, also an additional slope is needed. When the 
pair is inverted, for different nutrient classes, the values of the plateau and and slope are also swapped. Note that we 
order nutrients according to their carbon content and do not consider pair permutations. For this reason, for pairs of 
the same class (e.g. Fatty acids-Fatty acids), values 612 and &12 are not equal; 612 captures growth on media where 
the nutrient with more carbons is in excess, while 612 renders the opposite situation. 



2 

1 

Fatty acids 

Bases 

Amino acids 

Sugars 

bi2 

2.4 X 10-3 

8.8 X 10-"^ 

1.6 X 10-3 

hi 

1.2 X 10-2 

3.1 X 10-2 

2.9 X 10-3 

3.6 X 10-2 

Fatty acids 

^12 

1.4 X 10-^ 

1.2 X 10-2 

1.2 X 10-2 

^21 

3.5 X 10-3 

3.4 X 10-2 

3.9 X 10-3 

4.1 X 10-2 

Bases 

bi2 


7.2 X 10-^ 

3.0 X 10-2 

hi 


1.3 X 10-2 

2.8 X 10-3 

1.3 X 10-2 

Amino acids 

h2 



2. X 10-3 

5.4 X 10-3 

hi 



4. X 10-3 

3.3 X 10-2 


Table II: The metabolic pathways included in the logistic model to predict amino acids groups (H or L). We report in 
the first column the pathway names, sorted for decreasing Bayesian Information Criterion associated with the model. 
In the second column we list the number of amino acids participating in each pathway. 


Metabolic pathway 

No. a. acids 

1. alanine, aspartate and glutamate metabolism 

6 

2. valine, leucine and isoleucine degradation 

2 

3. phenylalanine, tyrosine and tryptophan biosynthesis 

3 

4. sulfur relay system 

2 

5. glycine, serine and threonine metabolism 

7 

6. arginine and proline metabolism 

7 
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No. Uptakes 


(b) 


Figure 4: Second order equitative synergy theory, (a) Predictions of the optimized synergy model (OS) Eq. (|^, empty 
blue squares, versus the FBA results, compared with the IM theory Eq. Q, filled red circles, for 100 different random 
media at increasing number of uptakes (see Methods and Appendix for the details on growth media). Here, we use the 


exact values of parameter d and the average interclass value of parameters b. (b) The relative error A = 


Iffmodel —gPBAl 


9fba 


vs. the number of uptakes for the IM (filled red circles) and the OS model (empty blue squares), averaged over 500 
different random media. The relative error of the IM theory grows almost linearly, while it remains much lower in the 
OS model and becomes roughly independent of the number of uptakes for E > 6. 
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Figure 5: Comparison of the OS model, Eq. (|^, (y axis) with the experimental growth of Beg et ai, (x axis); 
the dashed diagonal indicates perfect agreement. The uptakes corresponding to each experimental growth rate were 
computed (Appendix) and used as an input of the OS model to evaluate the predicted growth. The x error bars are one 
standard error, the y error bars indicate all feasible growths consistent with the uptakes plus/minus their error. We hnd 
a fair agreement between our theory and the experimental measurements, supporting that scaling and synergy are two 

principles regulating also microbial growth in vivo. 
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Appendix A: The metabolic reconstruction 

We use the genome seale E. coli metabolie reeonstruetion iAF1260 [|2^ . Sueh reeonstruetion 
features 1678 metabolites and 2392 reaetions, of whieh 299 are exehange reaetions. The minimal 
medium is composed by 18 essential nutrients Ca2, cobalt2, Cu2, Zn2, Mn2, cbll, H 2 O, Pi, H, 
K, Cl, Fe2, Fe3, mobd, Nal, Nh4, So4, Mg2 The fluxes of the reactions that uptake these 
nutrients are always kept different from zero. In our analysis we assume nutrient uptakes are 
known. Thus we focus exclusively on the 63 exchange reactions delivering sugars (22 reactions), 
fatty acids (6 reactions), amino acids (26 reactions), and bases (9 reactions) to the bacterium (see 
Table [rn]), and keep all other exchanges locked to zero. 


Appendix B: Flux Balance Analysis 


Flux Balance Analysis (FBA) is a mathematical tool to predict, under certain assumptions, the 
fluxes v> and the biomass production gpBA of a metabolic network [|9l. Given the stoichiometry 
S of the network, FBA aims at finding the solution of the metabolic mass balance equation under 
steady state condition. Denoting by c the vector of metabolic concentration, FBA seeks thus to 
solve the system of linear equations: 

c = Siy = 0. (Bl) 


Since in real metabolic networks there are much more reactions than metabolites, the above system 
is underdetermined and it allows several solutions. From the space of solutions, physiologically 
relevant points are usually selected by coupling the mass balance problem Eq. ( |B1| ) with an opt- 
mization principle. Quite generally, thus, a FBA problem seeks solutions to Eq. (Bl) such that a 
linear objective function Z of the form 




(B2) 


with Tfc some positive constants, is maximized. The objective function is often related to the 
biomass production. In our case we focus solely on the maximization of biomass polymerization, 
so that we have one flux only appearing in the sum Eq. ( |B2[ ) (which expresses the biomass synthe¬ 
sis) and we can assume Z = ^^fba- Einally, we note that when essential nutrients are assumed to 
available in excess, Eq. (B1) specifies a linear problem that is defined up to multiplicative constant: 
any solution to Eq. (B1) may be rescaled through a constant factor and still be a valid solution. We 
therefore keep uptakes in arbitrary units when validating our model against EBA. 


Appendix C: Generation of the growth media 

We focus only on nutrients that can be uptaken by the organism and produce growth lfT9]| . The 
growth media we generate therefore only contain sugars, fatty acids, amino acids, and bases. Since 
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Table III: The 63 uptake fluxes considered in our study. We include uptakes delivering sugars (22 reactions), fatty 
acids (6 reactions), amino acids (26 reactions), and bases (9 reactions) to the bacterium. 


Sugars 

Fatty acids 

Amino acids 

Bases 

1. L-Arabinose 

14. Maltose 

1. Octanoate 

1. Glycine 

14. D-Metbionine 

1. 

Allantoate 

2. L-Lyxose 

15. Melibiose 

2. Decanoate 

2. D-Alanine 

15. L-Metbionine 

2. 

Cytosine 

3. D-Ribose 

16. Sucrose 

3. Dodecanoate 

3. L-Alanine 

16. Ornithine 

3. 

Uracil 

4. D-Xylose 

17. Trehalose 

4. Tetradecanoate 

4. D-Cysteine 

17. L-Proline 

4. 

Adenine 

5. L-Xylulose 

18. Maltotriose 

5. Hexadecanoate 

5. L-Cysteine 

18. L-Valine 

5. 

Guanine 

6. D-Allose 

19. Maltotetraose 

6. Octadecanoate 

6. D-Serine 

19. L-Arginine 

6. 

Hypoxan thine 

7. D-Fructose 

20. Maltopentaose 


7. L-Serine 

20. L-Histidine 

7. 

Orotate 

8. L-Fucose 

21. 1-4-Qf-D-glucan 


8. L-Asparagine 

21. L-Isoleucine 

8. 

Thymine 

9. /3-D-Galactose 

22. Maltobexaose 


9. L-Aspartate 

22. L-Leucine 

9. 

Xanthine 

10. Galactose 



10. L-Homoserine 

23. L-Lysine 



11. D-Mannose 



11. L-Tbreonine 

24. L-Phenylalanine 



12. L-Rbamnose 



12. L-Glutamine 

25. L-Tyrosine 



13. Lactose 



13. L-Glutamate 

26. L-Tryptophan 




multiple uptake of sugars is not observed [l26ll . we allow for the exehange of one sugar only and 
randomly allow all other nutrients to be uptaken by the baeterium. Summing up all the exehange 
fluxes listed in Seej^ eaeh growth medium ean therefore be eomposed of 42 nutrients at the most 
(i.e. one sugar and 41 other nutrients), plus the 18 nutrients in the minimal medium. 

As the minimal medium is always ineluded, just eonsidering the 22 sugars and the 41 remaining 
nutrients, for eaeh growth medium we henee have a 63-dimensional random veetor of exehange 
fluxes <p whieh, for any fixed number of uptakes E, is generated as follows (see Fig. for a 
pictorial representation of the growth media): 

• Only one of the 22 entries delivering sugars is uniformly chosen at random. We randomly 
fix its value uniformly in the set cp^ug ^ (0., 1.) arb.units. 

• The remaining E — 1 uptakes are uniformly drawn at random among the 41 entries of 0 that 
do not correspond to a sugar. The value of each flux is again uniformly drawn at random in 
the set (0., 1.) arb.units. 

• The E nonzero entries of 0 are normalized so that 0i = 1 arb.units 

In all the complex growth media we generate we always include the essential nutrients, which are 
assumed to be present in excess, i.e. they are uptaken at a rate 1 x 10^ arb.units, equivalent to 
infinite uptake rate in the metabolic reconstruction. 
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Figure 6: Illustration of how random media are generated. Besides the minimal medium, we only consider growth 
on sugars, fatty acids, amino acids, and bases. Each random medium we generate only contains one sugar (the purple 
filled arrow), plus a set of other nutrients. The sugar and the remaining nutrients are all uniformly chosen at random. 
These nutrients and their uptake value form a random vector of exchange fluxes cj). In the figure we sketch as filled 
arrows all the nutrients included in the random medium and as empty arrows the ones not considered. For any random 
medium considered, uptakes are normalized so that 4’i = ^ arb.units.. 


Appendix D: Selection of the minimal model for the growth on amino acids 

When studying nutrient-class-wide pairwise interactions involving amino acids, we noticed 
that the jS' functions appearing in Fig. [^tended to acquire two plateau values. We hence divided 
the amino acids into sets H and L, according to whether their corresponding /?' plateau value was 
above or below 10“^, respectively. 

By doing this, we observed that the pathways that process a given amino acid correlate in some 
way with its associated (3' plateau values. Indeed, as we show in Fig. |7} many metabolic pathways 
feature either amino acids belonging to only one set, or a far exceeding number of amino acids in 
one of the two sets. 

We thus opted to predict whether a given amino acid belonged to group H (or L) by exploiting 
the minimum information on the metabolic processes it participates in. We developed a linear 
model TTj for each amino acid i and used logistic regression to estimate the probability Vi{i G 
HliTi} for metabolite i to belong to group H given model vTj. Considering a set Ai of n metabolic 
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Number of Amino acids 

4 6 


abc transporters 
aminoacyl-trna biosynthesis 
biosynthesis of secondary metabolites 
cyanoamino acid metabolism 
aianine, aspartate and giutamate metaboiism 
arginine and proiine metaboiism 
microbiai metaboiism in diverse environments 
nitrogen metaboiism 
glycine, serine and threonine metabolism 
d-glutamine and d-glutamate metaboiism 
d-alanine metaboiism 
cysteine and methionine metaboiism 
peptidoglycan biosynthesis 
two-component system 
bacterial chemotaxis 
vitamin b6 metaboiism 
beta-aianine metabolism 
pyrimidine metaboiism 
c5-branched dibasic acid metaboiism 
histidine metaboiism 
taurine and hypotaurine metaboiism 
lysine biosynthesis 
porphyrin and chlorophyll metabolism 
methane metabolism 
phenylaianine, tyrosine and tryptophan biosynthesis 
giutathione metabolism 
butanoate metabolism 
folate biosynthesis 
seienocompound metaboiism 
sphingolipid metaboiism 
novobiocin biosynthesis 
nicotinate and nicotinamide metaboiism 
valine, ieucine and isoieucine biosynthesis 
pantothenate and coa biosynthesis 
purine metaboiism 
suifur relay system 
sulfur metabolism 
glycerophospholipid metabolism 
phenyiaianine metabolism 
tyrosine metaboiism 
lysine degradation 
thiamine metaboiism 
tryptophan metabolism 


□ LSet 

□ H Set 


Figure 7: Number of amino acids in sets H and L for each metabolic pathway. We see that the amount of amino 
acids in each set is uneven in the majority of pathways, with most of them only featuring amino acids in the L set. We 
opted to exploit this characteristic to predict to which set each amino acid belongs to and automatically assign it a /3' 

plateau value. 


pathways, we assumed 


— 'bO 


Viii G = 




(Dl) 


1 + exp TTj ’ 


where the sum runs over the n pathways in A^. In Eq. (Dl) X/ is a binary variable taking value 1 
if amino aeid i partieipates to pathway j and 0 otherwise. All eoeffieients {k}j=i have real values. 
For eaeh set M. we estimate by miximizing the likelihood C = ni=i The eoeffieient 

^0 is related to the probability that an amino aeid i belongs to H while not partieipating to any 
pathway in vTj. As we aim to gain the maximum predietive power by exploiting the minimum 
information, we opted to seek for the smallest set M that yields the largest rate of eorreet guesses. 
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that is, which returns Vi larger than 0.5 for metabolites aetually belonging to H in the majority of 
eases. The minimum set may be found by minimizing the Bayesian information eriterion (BIC) 
Il23l , vir. 


BIC = (n + l)logiV-21og£, (D2) 

where n = || || is the size of the set Ai (i.e. the number of ineluded pathways), N is the number 

of amino aeids and C is the likelihood that the observed H, L sets are generated by models 

To seek for the minimal M., we started out with zero pathways and then used an iterative 
greedy approaeh that at eaeh step added the pathway that yielded the minimum BIC, that is, that 
maximized the likelihood C. The result of this iterative approaeh is shown in Fig. the first 
point features one metabolie pathway and renders a BIC elose to 30. Adding parameters (i.e. 
adding metabolie pathways) lowers the BIC up to n = 6 where there is no more signifieative gain 
in predietive power and adding more pathways only overfits the model, so that the BIC starts to 
grow. The whole analysis was performed using R (version 2.15.3 [|2^ 1. 

Onee we knew the profile of the BIC, we retained the set M. that minimized it. Sueh set is the 
best trade off between the likelihood C (i.e. the predietive power) and the number of pathways 
ineluded in the model. The six pathways ineluded in the final M. yielded a BIC = 27.3 and are 
listed in Table [TVl where we also report the BIC returned by all models featuring n < 6 pathways 
and the number of amino aeids partieipating in eaeh pathway ineluded. 

In Fig. 1^ we show the probabilities Vi{i G L\Tii) as a funetion of the number of pathways 
n in the model tt^. In our analysis we fix a threshold of 0.5 and assume metabolite i belongs to 
HifV, > 0.5 and i E S otherwise. The green shaded area in Fig. [^indieates the region where 
we expeet Vi to lie: for the vast majority of the amino aeids only a few parameters in the tt* 
are suffieient to elassify all amino aeids into sets L or H. For the ease n = 6 pathways, whieh 
minimizes the BIC, we see that there is only one amino aeid whieh is not eorreetly elassified, 
namely D-Methionine (met_D). All the rest of the amino aeids are eorreetly assigned to either L or 
H by only inspeeting whether they partieipate in the metabolie pathways listed in Table [TVl 
Sinee knowing whether a given amino aeid partieipates to these six pathways is suffieient to 
know where its assoeiated f5' plateau will lie, we deeided to model the fi' funetions through their 
phenomenologieal form Eq. ([^ and assign two possible values to parameters b, whieh are evalu¬ 
ated by averaging /)' eorresponding to amino aeids in the sets H and L separately. 


Appendix E: Optimal synergy in the second order model 

As shown in See. the IM model systematieally underprediets growth rates in presenee of 
multiple nutrients. As a result we have to inelude a synergy term in our model. We do so by 
introdueing the (3' funetions. However, we find that an equal eontribution of all synergisite terms 
overprediets the growth rate in eomplex media (see fig[T^. This is beeause resourees are limited 
and not all nutrient pairs ean develop sueh maximal synergy. We therefore eall this a naive eq- 
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Number of pathways, n 


Figure 8: The Bayesian Information Criterion as a function of the number of pathways n. Starting with zero pathways, 


we iteratively incorporated into the model Eq. (D11 the metabolic pathway that yielded the minimum BIC. This allows 
to gain predictive power and to lower the BIC up to n = 6 pathways (black arrow). Inclusion of further information 
does not enhance the predictive ability and only overfits the model. 


Table IV: The six pathways included in the model tt that minimizes the Bayesian Information criterion. We report 
in each row the name of the pathway, the number of amino acids participating in it, and the BIC value of the model 
containing all pathways up to the row, so that the last line has the minimum BIC value. 


BIC 

Metabolic pathway 

no. a. acids 

34.0 

alanine, aspartate and glutamate metabolism 

6 

33.3 

valine, leucine and isoleucine degradation 

2 

31.3 

phenylalanine, tyrosine and tryptophan biosynthesis 

3 

29.8 

sulfur relay system 

2 

29.1 

glycine, serine and threonine metabolism 

7 

27.3 

arginine and proline metabolism 

7 


uitative synergy (NES) model, that assuming maximal synergy among all nutrients deseribes an 
unrealistie seenario. 

In order to limit the overall synergy, we tested the equitative synergy (ES) theory, where re- 
sourees are equally distributed aeross the nutrient pairs. We ereated eomplex growth media as 
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Figure 9: The probabilities Viii S H\Tri) of each amino acid i varying the number of pathways n included in 
the model The shaded green area highlights the expected region where Vi should lie, i.e. Vi G [0,0.5] and 
Vi G (0.5,1] for amino acids in sets L and H respectively. For the majority of them, the inclusion of only a few 
pathways in Wi is enough to predict the correct set. When n = 6, that is, when the BIC is minimum, we correctly 
capture the behavior of all amino acids except for D-Methionine (metJD). 
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explained in See. with eaeh medium k eonsisting of nutrients and thus = Pk(Pk — 1)/2 
possible pairs. We then assumed that, for eaeh nutrient i, the uptake was equally invested in 
the Pk — 1 synergies sueh nutrient ean develop. Therefore, we eomputed the ES model growth on 


medium k by eorreeting the IM theory with the jS' eontributions Eq. Q as: 



(El) 


Here is the IM theory growth, Eq. Q, cTj is the elass of nutrient i, while the sum runs on the 
Pk possible nutrient pairs. Henee, with faetor 1/(Pk — 1), we equally spread aeross the E^ — 1 
synergies. 

The resulting model shows an improvement respeet to the IM theory, although the gain de- 
ereases when the number of uptakes grows. 

The deerease in aeeuraey for inereasing E of both the NES and the ES model suggests that the 
uptake of resourees is distributed in some optimal way. Sinee in the EBA approaeh metabolism is 
aimed at growth optimization, we hypothesized that uptakes are organized in sueh way to maxi¬ 
mize the nutrient synergistie eontributions to growth. Speeifieally, sueh optimality must be reaehed 
by eonsidering that nutrient uptakes that are invested to attain a eertain synergy may not eontribute 
to another synergy. In Eig. one elearly realizes how this ean be taken into aeeount. Indeed, the 
funetions shown in Eig. [^typioally have a growing regime followed by a 
plateau. The appearanee of the plateau means that the synergy is not affeeted by a variation of the 
uptake of nutrient ni, i.e. nutrient rii is in excess with respeet to nutrient 77 - 2 . Conversely, in the 
growing region, the situation is reverted and nutrient 772 is in exeess. The point T(77i, 772 ) = & 21/&12 
marks the transition from one regime to the other. Thus, if Cmfpm < C'n20n2(^2i/(^i2> nutrient 772 is 
in exeess: in sueh ease, 77i has been eompletely invested and it eannot be used in other synergies, 
while 772 can only eontribute further with an effeetive flux Cn 20 n 2 = C'n 20 n 2 ~ C'ni0niP(^i, ^ 2 ) 
1^ that is, with the surplus of its uptake. 

We henee devised the following method to aehieve optimality in the ease of limited resourees 
on eomplex growth media: 

1. Eor eaeh pair of nutrients i, j and eorresponding uptake fluxes (pi, (pj eompute the seeond 
order eorreetion Agij to the IM growth: 



tanh 


(E2) 


where cXi and Ci are the elass and the earbon eontent of nutrient i, respeetively. 

2. Rank all Ag^j from largest to smallest. The first in sueh rank will be the best eontribution to 
aeeomplish optimal growth. 

3. Add to the IM growth predietion the first eorreetion in the rank. 


^ CniP'n^ = Cn^cjin^ — /T(ni, 772 ), pn2 = 0 respectively if nutrient 77i is in excess 
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Figure 10: Second order model predictions, (a) Prediction of model bacterial growth against FBA results, for four 
models (see text): IM, NES, ES, OS. The idealized metabolism (IM, red circles) captures reasonably well EBA growth 
predictions. Including maximal synergy for all the nutrient pairs with a naive equitative synergy theory (NES, purple 
up triangles) largely overestimates the EBA growth. Considering a uniform uptake for all nutrient pairs with the 
equitative theory (ES, green diamonds) improves the IM results. When the number of uptakes is ^ 1, all these 
models produce worse results than the Optimized Synergy model (OS, blue squares), (b) The relative error A of the 
different models as a function of the EBA growth ^fba- The baseline is the first order IM theory (red circles), with 
a relative error that increases roughly linearly with the number of uptakes. The NES model (purple up triangles) is 
clearly unrealistic, with a relative error that increases very fast. The ES model (green diamonds), conversely, improves 
the IM results, although its A still increases with the number of uptakes. The OS model error (blue squares) remains 
very low and depends very weakly on the number of uptakes, suggesting optimal allocation of synergies is a robust 

explanation for maximal growth. 
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4. Reduce fluxes 0j and (pj, so to take into account that some uptake of nutrients i and j has 
been invested into their synergy: 

(a) For the nutrient in excess, say j, set cpj —>• cpj — Ci/ 

(b) Set (pi —)■ 0, as uptake of i has all been used to develop synergy Agij. 

5. Remove from the rank all synergies involving nutrient i, as its effective uptake is now zero. 

6. Re-compute the synergies {Agkj} with the new uptake flux <pj. 

7. Optimal synergy (OS) model: go to step[^ 

The process is iterated until no uptake flux can be diminished further. 

The above strategy to pinpoint optimal allocation of resources is really effective. The OS model 
gives very accuarate results even for a large number of uptakes and we thus opted for it. 

Note that the results presented are derived assuming that a sugar is always present in the 
medium. One can generalize and also work with sugar-free complex growth media. Because 
(3'{x) functions for (fatty acid, base), (fatty acid, amino acid), and (amino acid, amino acid) inter¬ 
actions are not perfectly captured by Eq. (|7]) when a; ~ 1, this scenario is better captured allowing 
for two different slopes of the beta functions: results for the OS model are slightly less accurate 
than in presence of sugars, but still far better than the IM, as shown in Fig. 



9fba 



Figure 11: (a) Predictions of the OS model (blue open squares) vs the IM model (red filled circles), for complex 
media that may not include sugars. To better capture non-sugar synergies we allow here 2 different slopes to the /3 
functions (b) The relative error A of the OS model (blue empty squares) and the IM model (red filled circles). Also 
when sugar are not always uptaken the OS model has a consistently smaller relative error than the IM model. 
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Appendix F: Comparison with the experiments 

Beg et al. [I^ published a few years ago a study that proves to be an exeellent means to eontrast 
our model against experimental results. In their work, the authors measured at high frequeney the 
growth rate of a bateh eulture of E. coli and the eorresponding variation of nutrient eoneentration 
in the medium, simultaneously. Additionally, they included in their paper measurements of the 
culture optical density and other quantities of interest. Ah the relevant measurements for our 
analysis are reported in Ref. [l20ll Fig. 2, panels a and b: in the following, we explain how to 
integrate such data in our approach. 

The hrst step to make the results of Beg et al. useful in our framework is to calculate, for each 
nutrient i, the uptakes given the time evolution of nutrient concentration Cj(f) reported in Fig. 
2b of Ref. [l20ll . For each nutrient i, the uptake (pi is related to the time derivative of the nutrient 
concentration q as: 

(FI) 

where is the molar mass of nutrient i, D{t) the microbial dried mass at time t and Vw is the 
working volume, which is provided by the authors in the supporting material of Ref. fl20ll (note 
indeed that concentration are provided per unit volume in Il20l l. This relation properly yields 
uptakes in mmol gDW“^ h“^, the units commonly applied in metabolic reconstructions and that 
we use in our model. 

From Eq. ( [fT] ), we see that, to compute <pi(t), first the derivatives Cj must be evaluated from the 
provided curves Ci(t), for each nutrient i. This is straightforward and can be accomplished with, 
e.g., centered differences. For each value Ci(t) we also compute the error evaluating the 
maximum and minimum slopes compatible with the given error bars of Cj(f), also reported in Fig. 
2b of Ref. (201. 

The second quantity to evaluate in order to calculate the uptakes is the dried weight E(t). 
We assume it to be proportional to the optical density 0(t), which is given in Fig. 2a of Ref. 
(20l . Knowing the initial optical density 0(0) and dried weight 0(0) (which is specified to be 
6.75 X 10“^ g), we are hence able to compute the whole D(t) curve, with its own error a£)(t) 
(evaluated from the known error on the optical density). 

After the above step, we are able to compute the uptakes (pi(t) and their associated errors 
(propagating aci{t) and aoit)), for each nutrient i and time t. Note that we do not allow 
negative uptakes (corresponding to nutrient release, really) and we discard noisy fluctuations of 
Ci{t) allowing for unexpected multiple nutrient uptakes at f < 3.5h. Consequently, (j)i{t) = 0 with 
zero uncertainty for all nutrients except glucose when t < 3.5h. The resulting uptakes are plotted 
in Fig. [T^ . 

Knowing all uptakes for each time t, we finally compute the growth gosit) predicted by the 
OS model by using Eq. (|^. We also derive an associated error <Jgos{t) by evaluating the growth 
rates yielded by the minimum = {(/>j(f) — i G nutrients} and maximum = 

{<pi{t) + G nutrients} possible uptake vectors, respectively. Therefore, in turn, <Jgos{t) = 
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9os{<P max (t)) -9 os{4> min («)). 

Albeit the experimental growth rate is partially provided in Fig. 2a of Ref. [[201 . we opt to 
calculate the experimental growth rate 5'expt(^) resulting from our estimate of the experimental 
dried weight curve D{t). The rationale is to have a 9expt{t) consistent with the D(t) values used 
to compute the uptakes. Note indeed that in Fig. 2a of Ref. [[20l the entire time series of the 
experimental growth rate is not available (i.e. time window t = 0 to f = 1.5h is missing), so we 
cannot proceed the other way around and estimate D{t) integrating back the growth rate. Hence, 
we evaluate (y'expt(f) from the differential equation: 


9expt(t) 


Dity 


(F2) 


that fixes the evolution of the dried weight in exponential growth condition. Again we estimate 
ID{t) from D{t) with centered differences and its error analogously to what done for ac{t). 
Finally, we compute the error for (7expt(f) by propagating and anit)- The growth 

rates (y'expt(f^) we find are entirely consistent with the ones originally published in Fig. 2a of Ref. 
[|20l . as shown in Fig. [T^ . However, as said, such 9expt{'t) values are more coherent with the dried 
weight we used in Eq. ( |FT| ) to compute the uptakes, so these are the ones we plot in Fig. 

Having computed 9osit) and 9exptit), we finally compare them in Fig. finding an excellent 
agreement. To obtain these accurate results, we use Eq. Q to estimate the value oif each a. In Eig. 
13 we show how results change when using the exact a values instead: the predictions are only 
slightly better. This finding is remarkable, because to use Eq. Q we only need to use the slopes 
ac (Fig. [^) and the carbon content of each nutrient, rather than the actual yield. The Oc values 
hold for all nutrients in a given class, while the carbon content of nutrients is generally known, so 
that Eq. Q can be readily applied to diverse situations without having to reevaluate single nutrient 
contributions to growth. 

Note that in these two validations against experimental results we only focus on the truly expo¬ 


nential growth phase, i.e. where tge^ptit) ^ which is the shaded region in Eig. 12 


A final remark on the fact that the experimental growth medium contains lactate and glycerol, 
which do not belong to nutrient classes we discuss presently. Again, one can proceed as we outline 
in Secs. [n|and [nrA] to evaluate parameters a and b for the classes corresponding to these nutrients. 
Eor organic acids, the class lactate belongs to, we find aorg_ac = 1-5 x 10“^, while b parameters for 
all cross interactions are reported in Table |Vj Eor glycerol, we opt instead to use the same a and b 
parameters we derived for fatty acids, which do yield accurate results already. 
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Figure 12: (a) The experimental uptakes 4> computed via Eq. (FI i, for the five nutrients considered in Fig. 2b of Beg 
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growth time window (f^expt {t) ^ 1), where we pick the points plotted in Fig. (b) Comparison of the growth rate 
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Figure 13: Model prediction of experimental growthg rates. We compare here the accuracy 
predicting experimental bacterial growth rates when using Eq. 0 to estimate the a parameters 
using the exact values of a (blue squares), which are evaluated by estimating the nutrients yield. Eq. ([^ performs 
fairly well, its predictions being only slightly worse than the ones obtained with the exact ds. This is remarkable, as 
it implies that, when dealing with physiological values, one can accurately predict growth rates by only knowing the 
slope Oc of each nutrient class and the carbon content of each nutrient, respectively, rather than the exact yield. 


of model Eq. (|9j at 
(red circles) and by 


Table V: The OS model b parameters for synergies with organic acids (org_ac in the Table). Interactions with amino 
acids again allows for two different plateau values of the /3' function. Nutrients are always sorted for increasing 
carbons; organic acids intra-class interaction does not consider pair permutations and yields thus two different b 
values: 6org_ac other Corresponds to growth on a medium where large carbon content organic acids are in excess, while 

Mother org_ac Captures the opposite situation. 


other 

^org ac other 

Mother org ac 

Sugars 

3.0 X 10-3 

1.7 X 10-3 

Fatty acids 

3.4 X 10-3 

1.1 X 10-2 

Organic acids 

2.7 X 10-3 

4.0 X 10-3 

Bases 

2.4 X 10-3 

2.4 X 10-2 

Amino acids 

3.0 X 10-3 

3.0 X 10-3 1.8 X 10-2 
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